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Title SYSTEM AND METHODS FOR SEARCHING USING QUERIES WRITTEN IN A DIFFERENT 
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has revealed their concordance to the requirements of patentability set forth by the Articles 1349 and 1350 of the 

Civil Code of the Russian Federation and decided to grant the Patent of the Russian Federation for the following 
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(51) IPC 
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1. A method for automatically translating query terms from one language and/or character set to another 
comprising: 

identifying a first set of anchor text written in a first format and containing a given term; 
identifying a set of documents to which the first set of anchor text points; 

identifying a second set of anchor text written in a second format and pointing to the identified set of documents; 
analyzing the second set of anchor text to determine that a representation of the given term in the first format 
corresponds to a representation of the given term in the second format 

2. The method of claim 1 , in which the first format comprises a first character set, and the second format comprises 
a second character set. 

3. The method of claim 1, in which the first format comprises a first language and the second format comprises a 
second language. 

4. The method of claim 1, in which analyzing the second set of anchor text includes identifying a term that appears 
most frequently in the second set of anchor text and designating the most frequently appearing term as the 
representation of the given term in the second format. 

5. The method of claim 1 , in which analyzing the second set of anchor text comprises: 
calculating a probability that the given term corresponds to a term in the second set of anchor text. 

6. The method of claim 5, in which the probability is obtained using at least one of Bayesian methods, histogram 
smoothing, kernel smoothing, and shrinkage estimators. 

7. The method of claim 5, in which the probability that the given term corresponds to a term in the second set of 
anchor text is obtained by dividing the number of occurrences of the term in the second set of anchor text by the 
total number of occurrences of all terms in the second set of anchor text. 

8. The method of claim 1, in which analyzing the second set of anchor text comprises: calculating a probability that 
the given term corresponds to each term in the second set of anchor text. 

9. The method of claim 1 , in which analyzing the second set of anchor text comprises: 
identifying a term that appears most frequently in the second set of anchor text. 

10. The method of claim 2, in which the first format is selected from the group consisting of: romaji, romaja, and 
pinyin; and in which the second character set is selected from the group consisting of: katakana, hiragana, kanji, 
hangul, hanja, and traditional Chinese characters. 
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1 1 . The method of claim 1 , in which the documents comprise web pages. 

12. The method of claim 1, further comprising: 

obtaining a query written in the first format and containing the given term; 

translating the query into the second format based at least in part on said analyzing step; 

searching a database for information written in the second format that is responsive to the translated query. 

13. The method of claim 12 in which the steps are performed in the order recited. 

14. A method for searching information in one format using queries written in another format comprising: 
obtaining a query written in a first format from a user; 

translating the query into a second format using a probabilistic dictionary, the probabilistic dictionary mapping terms 
from the first format to the second format; 

searching a database for information that is responsive to the translated query; and 
returning search results written in the second format to the user. 

15. The method of claim 14, further comprising: 
obtaining search result selections from the user; 

using said search result selections to modify the probabilistic dictionary of term mappings. 

16. The method of claim 15, wherein the modification comprises adjusting at least one probability associated with 
at least one mapping in the probabilistic dictionary. 

17. The method of claim 14, in which the step of translating the query into the second format includes expanding 
the query. 

18. The method of claim 17, in which the expanded query includes alternative encodings of the query terms. 

19. The method of claim 17, in which the expanded query includes alternative language translations of the query 
terms. 

20. The method of claim 17, in which the expanded query, includes alternative encodings and alternative language 
translations of the query terms. 

21. The method of claim 18, in which the expanded query includes synonyms of the alternative encodings of the 
query terms. 

22. A method for creating a probabilistic dictionary, the probabilistic dictionary mapping terms in a first format to 
terms in a second format, the method comprising: for a given term, identifying a first set of data in the first format 
that contains the term; 

identifying a second set of data in the second format that is aligned with the first set of data; and 

analyzing the second set of data to determine one or more probabilities with which the given term maps onto one 

or more terms in the second set of data. 

23. The method of claim 22, further comprising: adding the given term to the dictionary along with one or more 
probabilities with which the given term maps onto one or more terms in the second set of data. 

24. The method of claim 23, further comprising: repeating, for each term to be added to the dictionary, said steps of 
identifying a first set of data, identifying a second set of data, and analyzing the second set of data. 
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25. The method of claim 22, in which the first set of data comprises a first set of anchor text pointing to a set of one 
or more web pages, and in which the second set of data comprises a second set of anchor text pointing to the 
same set of one or more web pages. 

26. The method of claim 22, in which the first set of data comprises a set of text written in a first language, and in 
which the second set of data comprises the same set of text written in a second language. 

27. The method of claim 22, in which the probability with which the given term maps onto a term in the second set 
of data is calculated by dividing the number of occurrences of the term in the second set of data by the total 
number of terms in the second set of data. 

28. The method of claim 22, further comprising: modifying the probability with which the given term maps onto a 
term in the second set of data based, at least in part, on an analysis of a user's selection of search results. 

29. The method of claim 22, further comprising: modifying the probability with which the given term maps onto a 
term in the second set of data based, at least in part, on an analysis of a user's previous queries. 

30. A computer-readable medium, comprising instructions, which when executed by a computer system, are 
operable to cause the computer system to perform acts comprising: 

identifying a first set of anchor text written in a first format and containing a given term; 
identifying a set of web pages to which the first set of anchor text points; 

identifying a second set of anchor text written in a second format and pointing to the identified set of web pages; 
determining a probability that a representation of the given term in the first format corresponds to a representation 
of the given term in the second format. 

31. The computer-readable medium of claim 30, further including instructions, which when executed by the 
computer system, are operable to cause the computer system to perform acts comprising: 

modifying the probability that a representation of the given term in the first format corresponds to a representation 
of the given term in the second format based, at least in part, on an analysis of a user's selection of search results. 

32. The computer-readable medium of claim 30, further including instructions, which when executed by the 
computer system, are operable to cause the computer system to perform acts comprising: 

modifying the probability that a representation of the given term in the first format corresponds to a representation 
of the given term in the second format based, at least in part, on an analysis of a user's previous queries. 

33. The computer-readable medium of claim 30, in which the probability is determined using at least in part one of 
Bayesian methods, histogram smoothing, kernel smoothing, and shrinkage estimators. 

34. A method for translating a query provided in a first language or character set to a second language or character 
set comprising: 

identifying a first body of text written in a first format; 

identifying a second body of text written in a second format, the second body of text being aligned with the first 
body of text; 

creating a dictionary of translations between terms in the first body of text and terms in the second body of text by 
comparing the occurrence of terms in the first body of text with the occurrence of terms in the second body of text. 
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35. A method of claim 34, in which the dictionary of translations includes one or more probabilities associated with 
the translations. 

36. A method of claim 34, in which the first format comprises a first character set and the second format comprises 
a second character set. 

37. A method of claim 34, in which the first format comprises a first language and the second format comprises a 
second language. 

38. A method of claim 34, in which the first body of text comprises anchor text and the second body of text 
comprises anchor text. 

39. A method for performing searches using potentially ambiguous queries comprising: 
receiving a query containing at least one query term written in a first format; 
translating the query term into a plurality of variants written in a second format; and 

using one or more of the variants to search for information written in the second format that is responsive to the 
query. 

40. The method of claim 39, in which the first format comprises a sequence of numbers entered from a telephone 
keypad; and in which the second format comprises alphanumeric text. 

41. The method of claim 39, further comprising: 

obtaining the one or more variants by discarding variants in the plurality of variants that are not part of a predefined 
lexicon. 

42. The method of claim 39, further comprising: 

obtaining the one or more variants by discarding variants in the plurality of variants that contain predefined low- 
probability character combinations. 

43. The method of claim 39, in which the first format comprises alphanumeric text written in a character set 
selected from the group consisting of romaji, romaja, and pinyin; and in which the second format comprises 
alphanumeric text written in a character set selected from the group consisting of kanji, katakana, hiragana, hangul, 
hanja, and traditional Chinese characters. 

44. A method for performing searches using potentially ambiguous queries comprising: 
receiving a numeric query entered from a telephone keypad; 

translating the numeric query into a group of potential alphanumeric translations in a first format; 
discarding potential translations that are determined to include predefined lowprobability character combinations; 
translating the remaining alphanumeric translations from the first format to a second format using a probabilistic 
dictionary; and 

performing a search using the alphanumeric translations in the second format. 

45. The method of claim 44, in which the first format comprises text written in a character set selected from the 
group consisting of romaji, romaja, and pinyin; and in which the second format comprises text written in a character 
set selected from the group consisting of kanji, katakana, hiragana, hangul, hanja, and traditional Chinese 
characters. 
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To the application N<>20061 14696/09 



(54) SYSTEM AND METHODS FOR SEARCHING USING QUERIES WRITTEN IN A DIFFERENT 
CHARACTER-SET AND/OR LANGUAGE FROM THE TARGET PAGES 



ABSTRACT 



(57) The present invention relates to information search and retrieval. A technical result is performing searches using 
queries that are written in a character set or language that is different from the character set or language of the documents 
that are to be searched and providing relevant search results. For this purpose is received a sequence of ambiguous 
information components from a user and translated its into one or more corresponding sequences of less ambiguous 
information components. These sequences of less ambiguous information are provided as an input data to a search engine. 
The search results are obtained from the search engine and are presented to the user. A translation between these 
character sets and/or languages can be performed by examining the use of terms in aligned text. Probabilities can be 
associated with each possible translation. Refinements can be made to these probabilities by examining user interactions 
with the search results. 7 in. cl., 38 dep. cl.; 16 drawings. 
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PEfflEHHE 
o Btmane naTeHTa Ha H3o6peTeHHe 




i 



0004790808 



(21) 3a«BKa Jvfe 2006114696/09(015967) 



(22) flaTa no^ara 3aaBKH 13.09.2004 



B pe3yjiKraTe 3KcnepTH3bi 3a#BKH Ha h3o6pctchhc no cymecTBy ycTaHOBJiCHO, hto 
[ ] 3a)iBJieHHoe H3o6peTeHHe 
[X] 3aflBJieHHaa rpynna H3o6peTeHHfi 

OTHOCHTCH K o6l>eKTaM IiaTCHTHSIX npaB H COOTBeTCTByeT yCJTOBHHM naTCHTOCI30C06HOCTH ? 
npe^CMOTpeHHBIM rpa>K,U,aHCKHM KO^eKCOM POCCHHCKOH OeAepai^HH, B CBH3H c neM 

npHHJiTO pemeHHe o BBiflane naTeHTa Ha H3o6peTeHHe: 

SaioiiOMeHHe no pe3yi[i>TaTaM 3KcnepTH3ti npHjiaraeTC*. 



IIpHJio^ceHHe: Ha 10 ji. b 1 3K3. 



PyKOBOflHTCJIB 



B.n.CPTMOHOB 



4>opivia 3T» Ola 



(21)2006114696/09 

(51) MI1K 

G06F 77/27(2006.01) 

(57) 

1. Cnoco6 aBTOMaTirqecKoro nepeBO^a TepMHHOB 3anpoca H3 o#Horo A3BiKa 
h/hjih Ha6opa chmbojiob b Apyrofi, coflep^KamHH 3TanBi, Ha KOToptix: 
HfleHTH4>HE[HpyiOT nepsoe mho^ccctbo TeKCTa npHBH3Kn ? HanHcaHHoro b 
nepBOM ^opMaTe h coaep^amero flaHHBifi TepMHH; 

H#eHTH(|)^HpyioT mho5K6Ctbo flOKy mchtob ? Ha KOToptie yKa3BiBaeT nepBoe 

MHO^CeCTBO TeKCTa npHBK3KH; 

HfleHTH4)HIJHpyk)T BTOpoe MHO^CeCTBO TeKCTa npHB5I3KH, HanHcaHHoro BO 

btopom 4>opMaTe h yKa3BiBaiomero Ha H#eHTHc})HijHpoBaHHoe MHO^ecTBO 
flOKyMeHTOB; 

aHajiH3HpyioT BTopoe MHo^cecTBO TeKCTa npHB*3KH, htoGbi onpe^ejiHTB, HTO 
npe^cTaBneHHe AaHHoro TepMHHa b nepBOM 4>opMaTe cooTBeTCTByeT 
npeflCTaBJieHHio flaHHoro TepMHHa bo btopom (J>opMaTe. 

2. Cnoco6 no n. 1, b kotopom nepBBiii 4>opMaT co^ep^KHT nepBtra Ha6op 
chmbojiob, a BTopofi (J>opMaT co^ep^cHT BTopofl Ha6op CHMBOJIOB. 

3. Cnoco6 no n. 1, b kotopom nepBbifi ^opMaT coAep^cHT nepBBift ^3bik ? a 

BTOpOH (j)OpMaT COflep^CHT BTOpOH H3BIK. 

4. CnocoS no n. . 1, b kotopom aHajin3 BToporo MHO^cecTBa TeKCTa npHB5i3KH 
BKjnonaeT B "ce6ii HfleHTHcjDHKaiiHio TepMHHa, KOTopBiH nojiBJMeTCfl HanGojiee 
nacTO bo btopom MHO^cecTBe TeKCTa npHB^[3KH, h 06 03HaHeHH e HanGojiee nacTO 
noKBJi^K)m;erocK TepMHHa KaK npeflCTaBjieHM ^aHHoro TepMHHa bo btopom 
4>opMaTe. 

5. Cnoco6 no n. 1, b kotopom aHajiH3 BToporo MHO^cecTBa TekcTa npHB5i3KH 
co^ep^cHT 3Tan ? Ha kotopom: 
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BMHHCJMIOT BepOflTHOCTB TOTO, HTO ^aHHBIH TepMHH COOTBeTCTByeT TepMHHy BO 
BTOpOM MH05KCCTB6 TCKCT3. npHB5I3KH. 

6. CnocoG no n. 5, b kotopom BepoHTHOCTt nojiynaeTca c Hcnojn>30BaHHeM, no 
MeHBinefi Mepe, o^Horo H3 GafiecoBCKHX MeTO^OB, crjia>KHBaHira racTorpaMMBi, 
crJia^KHBaHHK (|)yHKijHH bjihhhhji h oijeHOK coKpam,eHH#. 

7. Cnoco6 no n. 5, b kotopom BepoirraocTB Toro, hto flaHHWH TepMHH 
COOTBeTCTByeT TepMHHy bo btopom MHO^cecTse TeKCTa npHB^3KH 3 nojiynaeTC^ 
nyTeM ^ejieroui KOJinnecTBa Bxo:*c#eHHH TepMHHa bo btopom MHO^cecTBe TeKCTa 

npHBH3KH Ha o6lIl.ee KOJIHHeCTBO BXOECfleHHH BCeX TepMHHOB BO BTOpOM 
MHO^CeCTBe TeKCTa npHB*3KH. 

8. CnocoG no n. 1, b kotopom aHajiH3 BToporo MHO^ecTBa TeKCTa npHB5i3KH 
co^ep^cHT 3Tan 9 Ha kotopom: 

BBIHHCJUIIOT BepO^THOCTB TOTO, HTO #aHHbIH TepMHH COOTBeTCTByeT Ka3KflOMy 
TepMHHy BO BTOpOM MHO^CeCTBe TeKCTa npHBH3KH. 

9. CnocoS no n. 1, b kotopom aHajiH3 BToporo MHO>KecTBa TeKCTa npHBii3KH 
co^ep^cHT 3Tan, Ha kotopom: 

HfleHTH (J)HL[HpyK)T TepMHH, KOTOpBIH nOflBJIflCTCfl HaH6oJiee HaCTO BO BTOpOM 
MHOflCeCTBe TeKCTa npHBJI3KH. 

10. Cnoco6 no n. 2, b kotopom nepBBiH cjwpMaT BBiSnpaeTCJi H3 rpynnti, 
cocTOHmeft K3: poMa,o;3H, poMa^3a h hhhbhhb; h b kotopom BTopoS Ha6op 
chmbojiob BBiGnpaeTC^ H3 rpynnsi, cocTO^men H3: KaH^H, KaTaKaHa, xnparaHa, 

XaHrblJIB, XaH^3a H TpaflHipiOHHBIX KHTaHCKHX CHMBOJIOB. 

1 1 . CnocoG no n, 1, b kotopom flOKyMeHTBi co^epacaT Be6-CTpaHHi{Bi. 

12. Cnoco6 no n. 1, AonojiHHTejibHO coflepacamHH 3Tanbi, Ha kotopbix: 
nojiynaioT 3anpoc, HanHcaHHbiH b nepBOM 4>opMaTe H co^ep^camHH ^aHHtift 

TCpMHH; 

nepeBOflOT 3anpoc bo BTopoft 4>opMaT, no MeHBHiefi Mepe, nacTHHHO Ha ocHOBe 
ynoMJiHyToro 3Tana aHajiH3a; 

HiuyT b 6a3e #aHHbix HH^opMai^Hio, HanncaHHyio bo btopom (|>opMaTe, KOTopaa 
COOTBeTCTByeT nepeBe^eHHOMy 3anpocy, 

13. Cnoco6 no n. 12, b kotopom 3Tani>i BBinojiHtfioTCfl b nepe^HCJieHHOM 
nopiiAKe. 
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14. Cnoco6 noHCKa HH(|)opMau;HH b o^hom (J)opMaTe c hciio jib3 obbhhcm 
3anpocoB, 3anHcaHHBix b ^pyroM 4>opMaTe, coflepacamHH 3Tanbi, Ha kotopbix: 
nojiynaioT 3anpoc ot nojib30BaTejra ? HanncaHHbiH b nepBOM 4>opMaTe; 
nepeBoa^T 3anpoc bo BTopofi 4>opMaT ? Hcnojib3yfl bcpo^thocthbih cnoBapb, npn 
3tom BepoHTHOCTHBifi cuoBapb OTo6pa>icaeT TepMHHti H3 nepsoro 4>opivraTa bo 
BTopoft (J)opMaT; 

HiuyT b 6a3e AaHHbix HH^opMaijHio., KOTopaa cooTseTCTByex nepeBe#eHHOMy 
3anpocy; h 

B03Bpaiu;aioT nojiB30BaTejiio pe3yjibTaTbi noHCKa, HanncaHHbie bo btopom 
4>opMaTe, 

15. Cnoco6 no n, 14, .nonojiHHTejiBHO coflep^camHH 3Tanbi, Ha KOTopbix: 
nonynaioT ot nojib30BaTejui BapnaHTbi BbiGopa pe3yjii>TaTOB noHCKa; 
HcnonB3yiOT ynoMjmyTbie BapnaHTbi Bbi6opa pe3yjn>TaTOB noHCKa ajui 

MOflH<|>HU;HpOBaHH3[ BepOflTHOCTHOIX) CJIOBapK OTOGpa>KeHHH TepMHHOB. 

16. Cnoco6 no n. 15, b KoxopoM MOflH^mcaitHK coflep^cHT KoppeKTHpoBKy, no 
MeHbmeH Mepe, oahoh BeposrrHOCTH, accoiinaTHBHO CB5i3aHHOH, no MeHLUieH 
Mepe, c o^hhm OTo6pa^ceHHeM b Bcp ohthocth om cjiosape. 

17. Cnoco6 no n. 14, b KOTopoM 3Tan nepeBO^a 3anpoca bo BTopofl (J)opMaT 
BKjnonaeT b ce6a pacninpeHHe 3anpoca. 

18. Cnoco6 no n. 17, b kotopom pacnmpeHHbiH 3anpoc BKJUonaeT b ce6* 
ajiBTepHaraBHBie ko^hpobkh TepMHHOB 3anpoca. 

19. CnocoS no n. 17, b kotopom pacuiHpeHHbifi 3anpoc BKJiioHacT b ce6^ 
ajibTepHaTHBHbie #3biKOBbie nepeBOflbi tcpmhhob 3anpoca. 

20. Cnoco6 no n. 17, b kotopom pacniHpeHHbift 3anpoc BKjnonaeT b ce6a 
ajibxepHaTHBHbie ko^hpobkh h ajibTepHaTHBHbie H3biKOBbie nepeBo^bi 
TepMHHOB 3anpoca. 

21. Cnoco6 no n. 18, b kotopom pacniHpeHHbiH 3anpoc BianonaeT b ce6% 

CHHOHHMbI aJIbTepHaTHBHblX KOflHpOBOK TepMHHOB 3anpOCa. 

22. Cnoco6 rjiz co3flaHH5i BepoflraocTHoro cuoBapa, npnneM ynoM^HyTbifi 
BepoOTHOCTHbifi cjiosapb oxoSpa^caeT TepMHHbi b nepBOM 4>opMaTe b TepMHHbi 
bo btopom 4>opMaTe, ynoMjmyTbiH cnocoG co^ep^HT 3Tanbi, Ha KOTopbix: rr% 
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^aHHoro TepMHHa H#eHTHtj>HU[HpyK)T nepBoe mho^kcctbo ^aHHbix b nepBOM 
c|)opMaTe ? KOTopoe coflep^cHT TepMHH; 

HfleHTK(J)Hi;npyK)T BTOpOe MHO^CeCTBO flaHHBIX BO BTOpOM (])OpMaTe, KOTOpoe 
BBipOBHCHO C IICpBBIM MHO^ECeCTBOM flaHHBIX; H 

aHajiH3HpyiOT BTopoe MHoacecTBO flaHHBix, hto6bi onpeAenHTB oflHy hjih 6ojiee 
BepoHTHOCTefl, c KOToptiMH ^aHHbiH TepMHH OTo6pa>KaeTca b o^hh hjih 6ojiee 

TepMHHOB BO BTOpOM MHO^KCCTB6 ^aHHBIX. 

23. Cnoco6 no n. 22, aonojirorrejiBHO coflep^camHH 3Tan, Ha kotopom: 
flo6aBJi3iK)T ^aHHBiH TepMHH b cjiosaps BMecTe c o^HOH hjih 6onee 
BepoiiTHOCTHMH, c KOToptiMH flaHHBift TepMHH OTo6pa}KaeTCfl b oflHH hjih Sojiee 

TepMHHOB BO BTOpOM MHO^CeCTBe AaHHBIX. 

24. Cnoco6 no n. 23, #onojiHHTejiBHO coaep^camHH 3Tan, Ha kotopom: 

nOBTOpHIOT flJIfl Ka^KAOrO TepMHHa, KOTOpBIH Hy^CHO A06aBHTB B CJIOBapB, 

ynoMJiHyTBie 3Tani>i H^eHTH^HKaijjHH nepBoro MHoacecTBa ashhbix, 

HAeHTH(|)HKaiJ[HH BTOpOTO MH05KeCTBa flaHHBIX h aHanH3a BToporo MHO^KeCTBa 
flaHHBIX. 

25. Cnoco6 no n. 22, b kotopom nepBoe MHo^cecTBO AaHHLix co^ep^cHT nepBoe 

MHO^CeCTBO TeKCTa npHB^3KH, yKa3BIBaK)merO Ha MHO>KeCTBO H3 o^hoh HJIH 

6onee Be6-CTpaHHn,, h b kotopom BTopoe MHo^cecTBO ^aHHBix co,nep^KHT BTopoe 

MHO^CeCTBO TeKCTa npHB^3KH, yKa3BIBaK)merO Ha TO »Ce MHO!>KeCTBO H3 OflHOH 

hjih 6ojiee Be6-CTpaHHii,. 

26. Cnoco6 no n. 22, b kotopom nepBoe mho>kcctbo #aHHBix co^ep^cHT 
MHO^cecTBO TeKCTa, HanncaHHoro Ha nepBOM 5i3BiKe, h b kotopom BTopoe 
MHO^cecTBO AaHHBix coflep^cHT to ace MHO)KecTBo TeKCTa, HanncaHHoro Ha 

BTOpOM K3BIKe. 

27. Cn0C06 no n. 22, B KOTOpOM BepOflTHOCTB, C KOTOpOH ,ZjaHHBIH TepMHH 
OT06pa^CaeTCH B TepMHH BO BTOpOM MHO>KeCTBe AaHHBIX, BBIHHCJISeTCJf 

nocpe^CTBOM ^ejieHHfl KOJinnecTsa bxo^cachhh TepMHHa bo BTopoM MHoncecTse 

AaHHBIX Ha oSmee KOJIHH eCTBO TepMHHOB BO BTOpOM MHO^BCeCTBe aaHHBix. 

28. Cnoco6 no n. 22, #onojiHHTejiBHO co£ep:acamHH 3Tan, Ha kotopom: 

MO^H^HIJiHpyiOT BepOKTHOCTB, C KOTOpOH ^aHHBIH TepMHH OT06pa3KaeTC5I B 
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TepMHH BO BTOpOM MHO^CeCTBe flaHHBIX, no MeHBUieft Mepe ? HaCTHHHO Ha 

ocHOBe aHanH3a nojiB30BaTejiBCKoro BBiGopa pe3yjn>TaTOB noncKa. 

29. Cnoco6 no n. 22, .zjonojiHHTejiBHO co^ep^caEqufl 3Tan, na KoropoM: 

MOAH(^HIIHpyiOT BepOflTHOCTB, C KOTOpOH ^aHHBIH TepMHH OTOGpa^CaeTCK B 
TepMHH BO BTOpOM MH03KeCTBe #aHHBIX, no MeHBHieH Mepe ? HaCTHHHO Ha 

ocHOBe aHajTH3a npe^Bmymnx nojn>30BaTejiBCKHX 3anpocoB. 

30. MaiHHHOHHTaeMMH HOCHTeJIB, COaep)KamHH KOMaHflBI, KOTOpBie, GypyHH 
HCnOJIHeHHBIMH BBIHHCJIHTeJIBHOH CHCTeMOH, pa6oTaK)T, HT06BI 3aCTaBHTB 

BBiHHCJiHTejiBHyio CHCTeMy BBinojiHHTB ^eHCTBHH, co#ep:>KamHe: 
Hfl;eHTH(^HKau;HK) nepBoro MHO^cecTBa TeKCTa npHBH3KH, HanncaHHoro b 
nepBOM 4>opMaTe h coflepacamero ^aHHBiii TepMHH; 

HAeHTH^HKaijHK) MHO^KecTBa Be6-CTpaHHU„ na KOTopBie yKa3BisaeT nepBoe 

MHO^CeCTBO TeKCTa npHBK3KH; 

H^eHTH(J>HKaii;Hio BToporo MHO^ecTBa TeKCTa npHBfl3KH, HanncaHHoro BO 
btopom (J>opMaTe h yKa3BiBaK)m;ero Ha H^eHTH(J)Hii|HpOBaHHoe MHO^cecTBo Be6- 
CTpaHHii;; 

onpeaejieHHe BepoaraocTH toto, hto npe^CTaBJieHHe flaHHoro TepMHHa b 
nepBOM <J)opMaTe cooTBeTCTByeT npeACTaBjieHHio #aHHoro TepMHHa bo btopom 
4>opMaTe. 

3L MaHIHHOHHTaeMBIH HOCHTeJIB no n. 30, flOnOJIHHTeJIBHO BKJIIOHaiODJtHH B 
Ce6H KOMaH^BI, KOTOpBie, 6yflyHH H CnOJIHeHHBIMH BBIHHCJIHTeJTBHOH CHCTeMOH, 
pa60TaK)T ? HTO6BI 3aCTaBHTB BBIHHCJIHTeJIBHyK) CHCTeMy BBinOJIHHTB fleftCTBHfl, 

co^ep^camHe: 

MOflH(J)HKaii;Hio BepoflTHOCTH toto, hto npeACTaBJieroie flaHHOTO TepMHHa B 
nepBOM (J)opMaTe cooTBeTCTByeT npe/jcTaBJieHHio Aamioro TepMHHa bo btopom 
4>opMaTe, no MeHBHieH Mepe, nacTHHHo Ha ocHOBe aHanH3a nojiB30BaTejiBCKoro 
BBi6opa pe3yjiBTaTOB noncKa. 

32. ManiHHOHHTaeMBiH HOCHTejiB no n. 30, flonojiHHTejiBHO BKjnonaiomHH b 

Ce65I KOMaHflBI, KOTOpBie, 6yAyHH HCnOJIHeHHBIMH BBIHHC JIHTe JIBH OH CHCTCMOH, 
pa6oTaK>T, HT06BI 3aCTaBHTB BBIHHCJIHTeJIBHyiO CHCTeMy BBinOJIHHTB AeHCTBHtf, 

coAepacamHe: 
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MOflHC^HKaijHK) BepojrraocTH toto, hto npeflCTaBJiemie ^aHHoro TepMHHa B 
nepBOM 4>opMaTe cooTBeTCTByeT npeflcxaBJieHHio saHHoro TepMHHa bo btopom 
4>opMaTe, no uenhuien Mepe, nacTHHHO Ha ocHOBe aHajnrea npe^BiflymHx 
nojii>30BaTejii»cKHX 3anpocoB. 

33. MaiHHHOHHTaeMBIH HOCHTeJIB no n. 30, B KOTOpOM BepOilTHOCTB 

onpeAejifleTCji c Hcnojn>30BaHHeM, no MeHBUien Mepe, ^acTHHHO o^Horo H3 
SaiiecoBCKHX MeTOflOB, crjia^KHBaHiM rncTorpaMMBi, cm a>KHB aHHs (JjyHKijHH 

BJIH5IHH5I H Ol^CHOK COKpameHHfl. 

34. Cnoco6 nepeBo^a npe^cTaBJieHHoro Ha nepBOM Ji3BiKe hjih Ha6ope 
chmbojiob 3anpoca Ha BTopoS ;i3bik : hjih Ha6op chmbojiob, coAep^camHH 
3xanbi, Ha KOToptix: 

HzteHTH^Hi^HpyioT nepByio nacTL TeKCTa, HanncaHHyio b nepBOM <|>opMaTe; 
HAeHTH(J)Hi^HpyioT BTopyro nacTt TeKCTa, HanncaHHyio bo btopom <|)opMaTe, 
BTopyio nacxL TCKCTa, BBipaBHHBaeMyio c nepBOH nacxBio TeKCTa; 
co3,a;aiOT cjioBapB nepeBO^OB Me^K^y tcpmhh aMH b nepBOH nacra TeKCTa h 

TCpMHHaMH BO BTOpOH HaCTH TeKCTa nOCpe#CTBOM CpaBHeHHSI BXOECfleHEUI 
TepMHHOB B nepBOH HaCTH TeKCTa C BXO^KfleHHeM TepMHHOB BO BTOpOH HaCTH 

TeKCTa. 

35. Cnoco6 no n. 34, b kotopom cjioBapt nepeBO^OB BKjnonaeT b ce6# o#ny 
hjih 6ojiee BepoOTHOCTeH, accoijnaTHBHO CBJi3aHHBix c nepeBo^aMH. 

36. Cnoco6 no n. 34, b kotopom nepBBiH 4>opMaT co^ep^cHT nepBtift Ha6op 

CHMBOJIOB, a BTOpOH (|>OpMaT COAep)KHT BTOpOH Ha6op CHMBOJIOB. 

37. CnocoG no n. 34, b kotopom nepBBiH 4>opMaT coflep^KHT nepBBiH #3Bik, a 

BTOpOH <|>OpMaT CO^ep^KHT BTOpOH 5I3BIK. 

38. Cnoco6 no n. 34, b kotopom nepBa^ nacTB TeKCTa co^ep^cHT TeKCT 

npHBH3KH, H BTOpa^ HaCTB TeKCTa COflep^CHT TeKCT npHBH3KH. 

39. Cnoco6 rjisi BBinojmeHHJi noncKOB c HcnojiB30BaHHeM noTeHiinajiBHO 
Heo^H03HaHHBix 3anpocoB, co^ep^camHH 3Tanti, Ha KOToptix: 

npHHHMaioT 3anpoc, co,n;ep}Kaii];HH, no MeHtuieH Mepe, oahh TepMHH 3anpoca, 
HanHcaHHBiH b nepsoM 4>opMaTe; 

nepeBO^HT TepMHH 3anpoca bo mho^ccctbo BapnaHTOB, HanncaHHBix bo btopom 
4>opMaTe; h 
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HcnojiB3yioT oflHH hjih 6onee BapnaHTOB #jm noHCKa HH^opMaijHH, 
HanHcaHHofi bo btopom 4>opMaTe, KOTopa* jiBjraeTCJi otbcthoh na 3anpoc. 

40. Cnoco6 no n. 39, b kotopom nepBBiH 4>opMaT co^ep^cHT 
nocjie^oBaTejibHOCTB iinc^p, bbc^chhbix c KJiaBHumoH naHejiH Tejie4>OHa; h b 

KOTOpOM BTOpOH <|)OpMaT COflepflCHT GyKBeHHO-IJHC^pOBOH TeKCT. 

41. CnocoG no n. 39, flonojiHHTenBHO co#ep;)KamHH 3Tan, Ha kotopom: 
nojiynaioT o^hh hjih 6ojiee BapnaHTOB nocpe^cTBOM OT6pacBiBaHira BapnaHTOB 
b MHO^KecTBe BapnaHTOB, KOToptie He flBjunoTca *iacTBK> npeAonpeflejieHHoro 

JieKCHKOHa. 

42. Cnoco6 no n. 39, AonojiHHTejiBHO co^ep^caniHH 3Tan, Ha kotopom: 
nojiynaioT o^hh hjih 6ojiee BapnaHTOB nocpe^cTBOM OT6pacBiBaHH5i BapnaHTOB 
b MHO^cecTBe BapnaHTOB, KOTopBie coAep^caT npeflonpeflejieHHBie 
ManoBepoHTHbie coneTaHiw chmbojiob. 

43. Cnoco6 no n. 39, b kotopom nepBBiii 4>opMax co^ep^KHT 6yKBeHHO- 
ijHcj^poBOH TeKCT, HanncaHHBiH b HaGope chmbojiob, BBiSpaHHOM H3 rpynra>i, 

COCTO^mefi H3 pOMa^3H, pOMa#3a H nHHBHHb; H B KOTOpOM BTOpOH (|)OpMa.T 

co^ep^cHT 6yKBeHHO-u;H4)poBOH TeKCT, HanHcaHHbiH b Ha6ope chmbojiob, 
Bti6paHHOM H3 rpynnBi, cocToameft H3 KaHA3H, KaTaKaHa, xnparaHa, xaHTBuiB, 

XaH^3a H TpaflHIJHOHHBIX KHTaHCKHX CHMBOJIOB. 

44. Cnoco6 ajm BBinojiHemiji noHCKOB c HcnojiB30BaHHeM noTeHU,HajiBHO 
Heo^HOSHa^iHbix 3anpocoB, co^ep^camHH 3TanBi, Ha kotopbix: 

npHHHMaioT ijH(j)poBOH 3anpoc, BBe^eHHBifi c KJiaBHiiiHOH naHejiH rene^ona; 
nepeBOfl^T ijh^poboh 3anpoc b rpynny noTeHH,HajiBHBix 6yKBeHHO-u;H(|)poBBix 
nepesoflOB b nepBOM 4>opMaTe; 

OTGpacBiBaiOT noTeHijHajiBHBie nepeBO#Bi, KOTopBie onpeflejunoTCfl KaK 
BKJiioHaiomHeTfenoBepoHTHBie coneTaHHii chmbojiob; 

nepeBO^T ocTaBinnecji 6yKBeHHo-iiH(|>poBBie nepeBO/jBi H3 nepsoro <|)opMaTa 

BO BTOpOH (|)OpMaT, HCnOJIB3yfl BepOJITHOCTHBIH CJIOBapB; H 

BBinojiHJiiOT noHCK, HcnojiB3yji 6yKBeHHO-n,H4)poBBie nepeBO^Bi bo BTopoM 
4>opMaTe. 

45. Cnoco6 no n. 44, b kotopom nepBBiH (|)opMaT co#ep>KHT TeKCT, HanHcaHHBifi 
b Ha6ope chmbojiob, BBi6paHHOM H3 rpynnBi, cocTo^meS H3 poMaA3H, poMa#3a 
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H riHHbHHb; H B KOTOpOM BTOpOH (j)OpMaT COflep^KHT T6KCT, HailHCaHHBIH B 

Ha6ope chmbojiob, BBi6paHHOM H3 rpyniiBi, cocTO^mefi H3 KaHA3H, KaTaicaHa, 
XHparaHa, xaHrbiJiB, xaH£3a h TpaflHitHOHHBix KHTaficKHx chmbojiob. 
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ripH ny6nHKaipiH CBe^eHHH o BBi^ane naTeHTa 6yAeT HcnojiB30BaHO oroicaHHe 
b nepBOHanajibHOH pe^a^HH 3aflBHTejm. 

ripH nyGnHKaujiH CBeAeHHH o BBi^ane naTeHTa 6y^yT HcnonB30BaHBi 
nepBOHanajiBHtie nepTe^cH. 
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npHJioaceirae 



K sa^BKe Jfe 20061 14696/09 

(54) CHCTEMbI H CIIOCOEbl ^JIH ITOHCKA C HCnOJIt30BAHHEM 
3ATIPOCOB, HAITHCAHHLIX HA 313LIKE H/HJIH HABOPE CHMBOJIOB 
OTJMHHOM OT TAKOBOrO J\J1R UpJlEBtlX CTPAHEDLJ, 

Pe^epaT 

(57) H3o6peTeHHe othochtch k noHCKy h BBi6opice HHcjwpMaijHH. 
TexHHHecKHM pe3yjii>TaTOM ABjraeTCfl oGecneneHHe bo3mo>khocth bbhiojihchim 
noHCKa c Hcnojib30BaHHeM 3anpocoB, HanncaHHBix b Ha6ope chmbojiob hjth 

JI3BIKe ? KOTOpBIH OTJIHHaeTCH OT HaGopa CHMBOJIOB HJIH JI3BIKa AOKyMeHTOB, 

KOTOpMe Heo6xoAHMo HaiiTH h nojiyneHiM pejieBaHTHBix p C3y JiBTaTOB noHCKa. 

JijiSL 3TOrO npHHHMaiOT nOCJieflOBaTejIBHOCTB HeOflH03HaHHBIX KOMIIOHCHTOB 

HH(|)opMamiH ot nont30BaTejM h nepeBOA^T b o,zi;Hy hjih Gojiee 
cooTBeTCTByiomHe nocjie^OBaTejibHOCTH Menee Heo£H03HaHHBix komxiohchtob 
HH<J)opMau;HH. 3th nocjie,a;oBaTejiBHOCTH MeHee Heo,n;H03Ha*iHOH hh dip opMau^HH 
npeaocTaBJiKK)TCJi Kax BxoAHBie flaHHBie B noHCKOByio ManiHHy, Pe3yjiBTaTBi 
noHCKa nojiy^iaioTC^ ot iiohckoboh MauiHHBi h . npe^cTaBJiKioTC^ 
nojiB30BaTejiio. IlepeBOA Me^y sthmh Ha6opaMH chmbojiob h/hjth Ji3BiKaMH 

MO^CeT 6BITB BBHIOJIHCH nOCpeflCTBOM HCCJieflOBaHHil HCIIOJIB30BaHIW TepMHHOB 
B BBipOBHeHHOM TdCCTC, BcpOKTHOCTH MOryT 6BITB aCCOIJHaTHBHO CBfl3aHBI C 
Ka^flBIM B03M03KHBIM nepeBOflOM. K 3THM BCpOJITHOCT5IM MoryT 6bitb c^ejiaHBi 
yTOHHeHH5J nOCpeflCTBOM HCCJieflOBaHHH B3 aHMO ^eHCTBHH nOJIB30BaTeJIH c 

pe3yjiBTaTaMH noHCKa. 7 h.ii.c|)-jibi h 38 3.il4>-jibi ? 16 hji. 
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